Setting HTML Spider Task Settings
|
Previous Top Next |
· | Specify the extensions of the files that should be regarded as HTML pages.
|
· | You can also choose whether to download stylesheets for Web-pages and pages from other sites.
|
· | By default pages are saved with *.htm extensions. If by some reason you consider this inconvenient you can uncheck the corresponding checkbox.
|
· | Depth of downloading, i.e. the maximal quantity of passages from the start page to downloaded pages.
|
· | The Ignore list button lets you specify a list of URLs that should be ignored when downloading a web site, making it possible to download only desired parts of a site.
|
· | Download images (checked by default)
|
· | Download images that link to other sites
|
· | Elimination of images by extension (after checking this checkbox you can choose whether to download images with appointed extensions or not)
|
· | Download files
|
· | Download files from other sites
|
· | Eliminate files by extension (after checking this checkbox you can choose whether to download files with appointed extensions or not)
|